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ABSTRACT 

Tii.ls project v/as concerned with the development of mathematical 
models of elementary mathematics learning and perf orrnance . Prohabi- 
listic finite automata and register machines vith a finite niimher of 
registers v/ere developed as models and extensively tested vith data 
arising from the elementary-mathematics strand ctUTicul-um the Institute 
for Mathematical Studies in the Social Sciences has developed over a 
period of more than five years- This curriculum is delivered to student 
in schools at teletype terminals in a computer-assisted- instruction mode 
"by telephone lin^s connected to the Institute's computer at Stanford « 

Probabilistic automata were defined and tested for basic addition, 
subtraction and multiplication exercises. An extensive report of this 
work is to be found in the recent book by Patrick Suppes and Mona 
Morningstar entitled Computer - assisted Instruction at Stanford^, I966-68: 
Data , Models , and Evaluation of the Arithmetic Programs, 



Detailed application of the register machine models is completed^ 
but the results are as yet unpublished and will appear subsequent to 
the date of this final report. 

The results of these analyses indicate that the kinds of models 
developed can be Applied to learning and performance in elementary 
mathematics, and have implications for detailed pedagogical procedures 
of instruction in these basic skills. 



Pinal Report 



Project No. 0-0115 
Grant No. OEG- 9- 70- 002^- 057 



Mathematical Models of Elementary Mathematics 
Learning and Performance 



Patrick Suppes 
Stanford University- 
Stanford^ California 9U505 



February I975 



The research reported herein was performed pursuant to a grant with the 
Office of Education^ U.S. Department of Healthy Education^ and Welfare 0 
Contractors undertaking such projects under Government sponsorship are 
encouraged to express freely their professional Judgment in the conduct 
of the project. Points of view or opinions stated do not^ therefore^ 
necessarily represent official Office of Education position or policy. 



U.S. DEPARTMENT OF 
HEALTH^ EDUCATION^ AND WELFARE 

Office of Education 
National Center for Educational Research and Development 



T.ABLE OF CONTEI.'TS 

Pag 



List of Tables 1 

List of Figures .2 

I. Introduction " y 

II • Methods 6 

III. Results 19 

IV „ Conclusions " 2k 

V, Bibliography - 25 



LIST OF TABLES 



Table Page 

1 Pretest Exercises in Coliomn Addition 20 

2 Maximum- likelihood Estimates of Parameters for 

the Three-^ Four-^ and Five -parameter 

Addition Models ■ 22 



EKLC 



LIST OF FIGURES 



Page 



Figiare 1. Proportion correct per problem for data 
and models. 



ERIC 



I . IIU'x^ODUCTION 



The psychological study of arithmetic skills^ like most other 
parts of psychology, has a relatively recent history--or.ly a few svc?- 
tematic studies were made "before I89O. The real impetus vas provided 
by E. L. Thorndike*s analysis of the learning of arithmetic in his 
Educational Psychology (1915? 191^) and later in his The Psychology 
of Aritlynetic (1922) , In an attempt to account for the acquisition 
of arithmetic skills in terms of his three psychological laws--the 
lav of readiness, the law of exercise and the law of effect--he tried 
to justify and analyze the reason for the traditional importance attached 
to drill and practice in arithmetic; for him the psychological purpose 
of drill is to strengthen the bonds between stimuli and appropriate re- 
sponses « He moved on from such fundamental questions to the more prac- 
tical ones of amount and distribution of practice. He emphasized the 
advantages of distributed practice and criticized the actual distribution 
of practice in textbooks of his time. Some effects of his work on the 
revisions of textbooks in the 1920s and later are doc-umented in Cronbach 
and Suppes (1969, PP. lOJ-llO). 

In the twenties and thirties there were a large number of good 
empirical studies of arithmetic skills, many of which were concerned with 
detailed questions that had to be answered in any complete psychological 
theory of arithmetic. For example, Buckingham (1925) studied student 
preferences and aptitudes for adding up or down in column addition 
problems- An extensive review of this literature may be found in 
Suppes, Jerman and Brian (1968). 

Empirical studies like those of Buckingham were not designed to 
develop an overall theory of arithmetic skills: nor, it is probably 
fair to say, vas Thorndike completely sensitive to the gap that existed 
between his theoretical ideas and the actual algorithms students were 
taught to solve problems. There are many stages to vork through : 1 
developing an adequate theory, and there is no one point at vhich one 
can say the theory is nov complete in all respects. If, for example, 
the theory is adequate at" sane conceptual level of information proces- 
sing, then it is possible to move on to additional perceptual questions. 
Moreover, once a perceptual theory of a certain level of abstraction is 
successfully developed, it is possible to go on to still more detailed 
perceptual questions, such as requiring the theory to include eye move- 
ments of students as veil as their numerical responses. It is an impor- 
tant methodological precept that at no foreseeable point shall ve reach 
a fixed and firm bottom beyond vhich ve cannot probe for fui^ther details 
and a more refined theory. 

As a background for the vork done under this project, ve briefly 
sketch the history of vork in the Institute for Mathematical Studies in 
the Social Sciences at Stanford over the past six or seven years on the 
psychological study of arithmetical skills. Rather than attempt a gen- 
eral coverage, ve shall concentrate on a single example — the simple one 
of column addition- -to illustrate hov ve have tried continually to deepen 



-^he theory^ Tlie methods and resiUxs cited for this sirr.ple case are 
extended in the bibliography of technical p^jblicationc to other 
arithmetical skills* 

It also should be mentioned that the data used for t^:-^'t.lng tiio 
models have all been collected as part :.he Institute activitic-:^ 
in computer-assisted instruction in elementary mathematics « The part 
of this vork that was supported by the c^ai^rent project was e-specially 
directed to t he elementary mathematical skills or disadvantaged stu- 
dents. Several of the publications^ especially Suppes and Mornin?-;star 
('1970)^ refer to early phases of this woi-k. 

The first question ve tried to answer was how can one predict r.he 
relative difficulty of different exercises of column addition*: for 
example^ we consider problems up to the size of three colmns and tliree 
row:3^ we are confronted with approximately one billion problems. A 
meaningful theory must drastically reduce this large number of exercises 
to a small number of classes in which all members of a class are essen- 
tially the same in difficulty. 

Our first approach (Suppes, Hyman & Jerman^ 196?) was to identify 
a small mjimber of structural fe^atures that would permit us to apply 
linear regression models to predict either probability of correct re- 
sponse or expected latency of response. Additional applications of 
such regression models may be found in Suppes^ Jerman and Brian (I968) 
and Suppes and Morningstar (1972). The application of such regression 
models is exemplified in Section III of this report. As can be seen 
from the information given there^ the fit of the regression model to 
mean student-response data on column addition exercises is not bad. 
Conceptually^ however^ there are obvious lacunae o The regression model 
that predicts response probabilities does not really postulate a specific 
process by which students apply an algorithm to solve an exercise. 

The next level of theory developed is aimed precisely at offering 
such process models » The natural theoretical tools for providing procesi; 
models of algorithmic tasks are automata^ and for most of elementary 
arithmetic^ simple finite automata are satisfactory. There ±s^ however, 
one weakness in finite automata as ordinarily defined^ namely, they xhave 
no place for a probabilistic theory of error^ so the natural step is to 
move from finite deterministic automata to probabilistic automata. 

An automaton becomes probabilistic by making the transition func- 
tion from state to state probabilistic in character. Thus^ from a given 
input and a given internal state there is a probability of going to any 
one of several different states. In general one wants to make the output 
function probabilistic also. This means that given an internal state and 
an input there is a probability distribution over the next, output. (These 
ideas are made formally definite in Definitions 1 and 2 of Section II.) 
By drastically reducing the source of error to a small number of param- 
eters^ we can develop and apply manageable probabilistic automata to 
student-response datae 



Such a prolDat'ilistic automaton riodel takes a definite step beyor.d a 
regression model in providing in an abstract sense an adequate informa^i on- 
processing model* Prom a psychological standpoint^ on the othei* hand^ the 
automaton models are unsatisfactory in that they lach any perceptual com- 
ponents^ and therefore they do not deal directly vith how the student 
actually processes the format of i.^^itten symbols in front of him. 

Our final effort as part of this project vas very riuch directed at 
this point. In principle^ it would be possible to continue the develop- 
ment of automaton models vith an abstract concept of state to represent 
the student's perceptual processing. A weakness of this extez^.sion of 
the automaton models is that when the states are left in a general ab- 
stract formulation it is natural to end up designing a different autom.aton 
for each of the different tasks in elementary mathematics^ and a plethora 
of models results o Closer examination of the algorithmic tasks of arith- 
metic facing the student in solving exercises indicates that the various 
tasks have much in common. This commonality suggests a sonewhat differ- 
ent approach^ an approach via register machines with perceptual instruc- 
tions c 

Register machines were first introduced by Shepherdson and Stm^gis 
(1965) to give a natural representation of computable functions in terms 
that are closer to the idea of a computer accepting instructions than to 
a Turing machine. In the case of the representation of computable func- 
tions^ a rather simple set of arithmetic instructions is sufficient. In 
particular^ an unlimited register machine hac a denumerable sequence of 
registers, but any given program only uses a finite nmber of these reg- 
isters and the machine accepts six basic instructions: add one to a 
register, subtract one^ clear a register, copy from one" register to another 
and two jtimp instructions, one conditional and one not. (This set of six 
instructions is not minimal^ but it is convenient-) Obviously, for the 
perceptual processing that a student does we want a different register 
machine and a radically different set of instructions. In addition, it 
is natural to postulate only a finite fixed number of registers that 
the student can usee 

The basic idea of this approach is to drastically simplify the per- 
ceptual situation by conceiving each exercise as being presented on a 
grid. The student is represented by a model that has instructions for 
attending to a given square on the grid; for example, in the standard 
algorithms of addition^ subtraction and multiplication we begin in the 
upper right-hand corner and then have instructions to move downivard 
through each column and from right to le-ft across columns. Additional 
instructions for storing the results of ,an operation, for outputting 
the last digit of a stored, ntimeral, etc., are needed. Some further 
details, are given in Section II, but the discussion is not as complete 
as that for automaton models. 

The basic idea of register machines is that the different algorithms 
are represented by subroutines. One subroutine may be called in another, 
as complex routines are built up. The procedure is familiar to most of 



> For exsL^ple^. in p^-r forming cclv^Tin muit.ipli ration v;e use the ai,-:..- 
rl^hni of addition, which in thic ca^e means calling the subrcvine f;::- 
addition; in long division we call the- £>-ub.^outine£- for subtractl::: a*:": 
^ul-^^iplication, a^^- well as for adaption, Each oaslr 3ubr out.-j.e I" r-rp- 
r-^^ffnt-^d by a program m terns of thz- primitive in-^trucrion^ ■ Tl:e pvob- 
lem from a psychological standpoint is to find instructions ohat i:^rjvi.>o 
not only a realistic description of what the student does, a desc^'inticn 
that can be fitted to data in the same way t.hat the automaton monels have 
b'-en applied to data, but also a fuller account of how the student pro- 
Cvo.r^^^s the exercise. 

the first stage of analyzing register machine models we can get 
rr-sult'-. similar to those for the automaton models- by postulating error 
parameters for execution of mam subrout-ines of the rou'-ine for a .-^Iven 
algorithm. In adaition to providing an explicit analysis of porcopT.ual 
proce-.^sing, the register machines provide a natural device for anatyr.in:7 
i -arnmgc However, the aet'-ailed and technical extension of register 
marhines t;o learning was not made during r,he couj'se of the present proj- 
ect, but Is work planned for the future. 

In Section IT we describe the theoretical and empirical methods 
used m the. project and give a brief;, but technical, description of 
th^r: most, important models tested. Finally, in Section 111 ve sicmnarize 
the results of the test of the models, but because of the great techni- 
cal detail of the results this report provides only a summary, ana we 
r..r.r the reader to publications that have already appeared or are 
forthcoming, if he wishes to p'ursue the details c 



II, ME'^^GLS 

a:. though the b"alk of the research conducted under this project 
wa>. co-.cerned with probabilistic a"atomaton and register machlne'models, 
w^ extended our earlier work on linear regression models and begin thl^. 
di:.cussion of the models with a presentation of the linear r e .v.- io>n 
model for col'omn addition reported in Suppes and Morningstar ?19:'2;o 

Linear regression models . We began with regression models thai', 
use as independent variables structural features of Individual arith- 
metie: exercises. We denote the jth structural feature of exericst.- i 
in a given set of exercises by f^^. 'Vhe parameters estimated from the 
daf;a are the values attached to each structui^al featui^ec (in previous 
publications we have referred to these structural features as factors. - 
but this can lead to confusion with the concept of factor as used in 
factor analysis o) We denote the coefficient assigned to the j"^'^ 
structural feature by cy^, and we emphasize that the structui^al fea- 
tures themselves^ as opposed to their coefficients, are objectively 
identifiable by the experimenter in terms of the exercises themselves, 
independent of the response datao 




Let Pj^ "be the olDserved proportion of correct responses on exercise 
i for a given group of students. The natural linear regression in terms 
of the structural features f . . and the coefficients a. is simply 

Unfortunately, when the regression is put in this form, there is no 
guarantee that probability will be preserved as the structural features 
are combined to predict the observed proportion of correct responses. 
To guarantee conservation of probability, it is natural to make the 
following transformation and to define a n^w variable z^. 

1 - p. 

(1) 2 = log — — i , 

and then to use as the regression model 

(2) z. = Z c^.f . . 

The nmerator of equation (l) contains 1 - p. rather than p., so 
that the variable increases monotonically rather than decreases 

monotonically with the magnitude of the structural features f... 

In Chapter 5 of Suppes and Morningstar (1972), the following 
structural features were defined for column-addition exercises. 

ffhe . feature SUMR is the ntimber of colmns in the largest addend. 
For Jihree-row exercises SUMR is defined as I.5 times the nmber of 
coTtimns, plus .5 if a column sum is 20 or more. For example, 

a 

SUMR ( j_b ) = 1 
c 



a 

• SUMR ( ^ i 
de 

SUMR (ab + c = de) 



1.5 if de < 20 
if de > 20 



This structural feature reflects the nmber of colmns of addition, 
with greater weight being given to colmns in three-row exercises 
than in two-row exercises « 
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The Second structiu^al feature is CAR;, vhich represents the n-uinber 
of tiraes the of a column;? including any numbers carried to it, 
exceeds nine« For example. 



if b + d < 9 



if b f d > 9 



if b + d f < 9^ a c f e > 9 
if b 4- d f > 9^ a f c f e > 9 « 

The third structural feature VF reflects the vertical format of 
the exercise » The vertical exercises with one-diglfc responses were 
given the value 0« Multicolumn exercises with, mulbidigit exercises 
and one-col-Qinn addition exercises with a response of ].l were given 
the value 1, One-column addition exercises with a multldigit response 
other than n were given the value 3. For example^ 

ab 

V.F ( -_cd ) r.- 0 
e 

abc 

VF ( i^def ) 1 
""ghi 

a 

VF ( + b ) . 

This str'^ct'Ural feature is meant to reflect the likelihood of the 
mistake of reversing the digits of -the correct response^ especially In 
a one-column addition exercise « In the computer-assisted instruction 
environment vhere students were responding at teletype terminals, 
responses to vertical exercises were typed from right to left, while 
responses to horizontal exercises were typed from left to-rightio Thus^ 
it was possible for a student to have in mind the correct answer^' but 
to err by typing the digits in the reverse order « It is fair to say 
that t,hls structural feature i^ of more importance in working at a 
computer-based terminal than when using paper and pencil. 



GAR ( i^b ) - 0 
c 

CAR (a + *b cd) - 1 
ab 

GAR ( j_cd ) ^ 
ef 



ab 

ghi 
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Three - state auto: ' .e central weakness of the regresGion 

models is that they a )dels. They do not provide ^ a 

schematic analysis o.: iuuic steps the student uses t. : ^.^ an 

answer. Automaton model-^ process models and therefore theii; " rep- 
resents a natural extension of the regression analysis. For the exercises 
in. column addition we may restrict ourselves to finite automata^ but as 
ordinarily defined they have no place for errors. However^ this is easily 
introduced by moving from deterministic state transitions to probabilistic 
ones- 

We begin with the definition of a finite deterministic automaton^ 
and then generalize. 

Definition 1. A structure SI = (A^ V^^, V^^M^Q^ s^) ±s a finite 
( deterministic ) automaton with output if and only if 
(i) A i£ a finite J nonempty set ^ 
(ii) Vj and Vq are finite nonempty sets ( the input and output 
vocabularies , respectively ) ? 

(iii) M i;S a function from the Cartesian product ' A X Vj to A 
(M defines the transition table ), 

(iv) Q l£ 3; function from the Cartesian product A X Vj to Vq 
(Q the output function ) ^ 

(v) Sq i£ in A (sq Is the initial state). 

As an example of a finite automaton with output^ that is^ a finite 
automaton in the sense of this definition^ we may characterize an autom- 
aton that will perform two-row coliomn addition. 

A := {0,1} . 

- {(m^n) : 0 < m, n < 9} , 

Vq - {0,1. -.,93 , 



M(k,(m,n)) =1 



0 if m + n + k<9^ 

1 if m+n+k>9,, for k = 0,1 ^ 



Q(k, (m,n)) = (k + m + n) mod 10 , 
Sq = 0. 

Thus the automaton operates by adding first the ones^ column, storing 
as internal state 0 if there is no carry, 1 if there is a carry, 
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outputting the s\m of the ones* coluimn modulus 10, and then moving on 
to the Input of the two tens* column d'^^its^ etc. The initial internal 
state Sq is 0^ because at tiie begir.. of the exercise there is no 
* carry* , 

Definition 2. ' A structure SU = (A^, V^^V^^p^q, s^) is a (finite) 
probabilistic automat on if and only if 
(i) A a finite J nonv^-^mpty set j 
(ii) and are finite, nonempty sets ^ 

(iii) P is a function on A X V X A to the interval [0,1] such 
that for each s jjn A and cr in p is a probability density 

over Aj i-^"^ 

(a) for each s * in A, p (s * ) > 0^ 



(b) Z p^ (sO = 1, 



s*gA 



(iv) 



q j£ a function on A X X to ' [0,1] such that for each 
s in A and cr in V, q. i£ a probability density over ^ 

(v) is in A. 

In the probabilistic generalization of the automaton for column 
addition^ the nmber of possible parameters that can be introduced il^ 
uninterestingly large. Each transition M(k, (m,n)) maybe replai^ii 

by a probabilistlir transition 1 - e. and e, . and each output 

K,m,n k^m,n^ ^ 

Q(k(m^n)), b^ IC OTobabilities for a total of 2200 parameters. 

A three -parss^::.er automaton model structurally rather close to the 
regression model x:^ easily defined. First^ two parameters, e and t], 
are introduced acrordlng to whether there is a * carry* to the next 
columno 

P(M(k^(m^n)) = o|k + m + n<9)=l-.e 

and 

P(M(k, (m,n)) =T7 k + m+n>9)=l-Ti. 

In other words, if there is no * carry*, the probability of a correct 
transition is 1 - e and if there is a * carry* the probability of such 
a transition is 1 - t^. The third parameter, 7^ is simply the prob- 
ability of an output error. Conversely, the probability of a correct 
output is: 

P(Q(k, (m,n)) = (k + m + n) mod 10) = 1 - 7 „ 
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Consider now exercise i with carrys and D^^ digits e If • 

\re ignore the probability of two error r. leading to a correct response 
(e.g.^ a transition error followed by an output error )^ then the prob- 
ability of a " ^ct answer is just 

D • C D - C • - 1 

(5) P(Cr }ct , to Exercise i) ^ (l - 7) ^(l - 11) '"^(l - g) ^ ^ 

As already indijated^ it is important to realize that this equation is 
an approximation of the 'true^ probability* However^ to compute the 
exact probability it is necessary to make a definite asstmiption about 
how the probability 7 of an output error is distributed among the nine 
possible wrong responses. A simple and intuitively appealing one-paramet 
model is the one that arranges the 10 digits on a circle in natural order 
with 9- next to 0^ and then makes the probability of an error j steps to 
the right or left of the correct response 6 J . For example, if 5 is the 
correct digit^ then the probability of responding h is 6^, of 3 is 5^^ 
of 2 is of 1 is of 0 is of 6 is 6^ of 7 is 6"^^ etc. 

Thus in terms of the original model 

7 - 2(& + 6^ + &5 ^ 5'^) + 55 ^ 
Cons let er now the exercise 

'+7 



Then., where d^ = the i'-^ digit response^ 

P(d^ = 2) = (1 - 7) , 

P(d2 = 6) = (1 - 7)(1 - Ti) +.T1& . 

Here -i3ie additional term is x]^ ^ because if the state entered is 0 
rathsi than 1 when the pair (7^5) is input ^ the only way of obtaining 
a corre=X!t answer is for 6 to be given as the sura of 0 + if. + 1, v^hich 
has probability 5o Thus the probability of a correct response to 
thi£L::s^rcise is (l - 7)[(l - 7)(l - ri) + ti&] , Hereafter we shall 
ign^r^^he (or eS) terms. 

may get a direct comparison with the linear regression model 
if Tse i:ake the logarithm of both sides of (5) to obtain: 

(h) l3g p. = D. log (1-7) + C^ log (1 - T)) + (D. - C. - 1) log (1 - g) , 

and estimate log 1- >^ log 1- x]^ and log 1- e by .regression with 
the a^itive constant set equal to zero. We also may use some other 

apprcsach to estimation such as minimtira ^ or maximum likelihood. 
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The automaton model naturally suggests a more detailed analysis of 
the data- UnliKe the regression model, the automaton provides an imme- 
diate analysis of the digit-by-digit responses* Ignoring the eS-type 
terms, we can in fact find the general maximum- likelihood estiinate:^ of 
7^ £p and t] wh-->n the response data are given in this more explicit 
f orm« 

Let Lucre be n digit responses in a block of exercises. For 
1 < i < n let be the random variable that asstunes the value 1 if 

the i"*^^ response is corjrect and 0 otherwise. It is then easy to see 
that 

if i is a ones '-colTJunn digit, 

e) if it is not a ones* column and there 
is no carry to the i'^h digit^ 

T]) if there is a carry to the i^^ 
digit. 

Similar Ir for the same three 



7)(1 - 
7)(1 - - 

So for a string of actual digit responses x-^, ..<.,Xj^ we can write the 
iikelihood function as: 

05) Ji(x^,...,x^) = (l-7)V(l-e)''(l-.Ti)^[l- (l-7)(l-e)f[l-(l-7)(l-Ti)]' 

Where a = number of correct responses, b = ntmiber of incorrect responses 
in the ones' column, c = nT^rrrlter of collect resp>;>3!ses not in the ones* 
column when the internal stsnt;^ is 0, d = number* af correct responses 
when iihe internal state is 1^ e = ntmiber of incorrect responses not in 
tke ones' --^olumn when the internal state is 0^ and f = number of incor- 
rect respoiises when the inteamal .state . is 1, (in the model statistical 
±ndepeiiiis;isH:e of responses is assured by the correction procedureo) It 
is more -^nvenient to estimate 7'=l-7^ e*=l-e, and ti' = 1 - -q. 
Making tiis change, taking the logarithm of both sides of (5) and dif- 
ferentiating with respect to each of the variables, we obtain three 
equations that determine the maximum-likelihood estimates of 7^;, 
and r\ ' : 

_ b ee/ " fTi^ ' 



P(x. = 1) = { 



(1- 7) 
(1 - 7)(1 



(1 - 7)(1 - 



granted tbat €6-type terms are ignored. 
-.Iternatives 



1} 
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P(x. = 0) - ^ 1 - (1 
1 - (1 
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Sit - c' " 1 - " 



i 



Scoirixrii these equations, we obtain as estimates: 

! a - c ■ d 
'^"a + h- c- d^ 



^ t ^ c(a + b - c - d) 
^ (c + e)(a - c - d) ^ 

I ^ - C. - d) 

^ '(d"^-+"'f]7i! - c - d) 



A"— " ~ - data using these estimates i:. discussed in the next secoi:::. 



? c-jz paj; 'ajTie t e r and five - par aine te r £':itomatrj: r:::Ddels . As a result 
pj'i-rlamriiarT xrork with the data selectee ~o test ~- model.', we consldored. 
rrc ^^n:.ansl::riS of the basic automaton Thi-^- fllx'st ±c a four- 

TSi3i!^/jsr Tnciniii^l that generalizes the h^ziz three— cHrrrsreter model to 
.arrraiLnit foi' snme contextual effects thz^ -?7ere obaerv^, in student re- 




to addition problems with consB'^rrtive csrr^." a- .:, no-carry receiving 
The Second model generalizes tLie four -par airir^^er model fui-^thei 



tro .*r.^sBn.' t fifth parameter » This extension allows tbi:- model to treat 
^ ^■ - -rrn r. ■/ ' imns thai: consist of only one digit di:~ferr-ntly from normal 
c::f— rjanni Liiat "onsist of pairs of digits to be adoBdo 



Tte four~^3axaineter model is constructed by 5p_itta:ng parsoneter 
def::l- -c. /^i?OYe, _±Qto two parameters derrated by £q and This i£ 

dcrr- listlxigu±sh addition columns -d^zh^ when correctly solved, do 
nc^ eive 2. carry from an iramediatelsr preceding c-^limii , although r.iiat 
prv- zszT- does receive a carry. Thuc., the model tiow reflects the pos- 
sx_... .nfluBnce of the present state ( .axry) upon a student's tendency 
fc r.^t- '^ate unnecessary carrys^. The reasons for tiiis inodification are 
IDE: ..earer when we discuss the analysis of the data- Parameters 7 
a:. are retained as they \rer.e defined in the three -parameter model « 

IElc: j.zi-i'initi :2ti x)f £ is replaced by 

vL-- ^ -S the carry (0 or l) to coltmin i of the problem „ 

Zl:i' i' :r-pararaeter model is also a two- state majhine, but with one 
tran:,,_-ioii probability now a function of the state value. 
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The probability that any given addition problem will be solved 
correctly may be written: 



(6) P(A11 Answer Digits Correct) 

= (1 - 7) (1 ' rj (1 - Cq) (1 - e^) ^ 

where D and C are the ntimber of digits and carrys a3 defined earlier 
and E is the ntunber of coltunns in the problem that do not receive a 
carry^ but which are preceded by a coltunn that does.. Vie continue to- 
assume that all errors occur independently and yet do not combine l;o 
produce correct answers. This assumption applies to all the automaton 
models we discuss 0 

We shall not reproduce the details of the likelihood equation for 
the four- parameter models These are given in Suppes and Morningstar 
(1972, p. Ihh). 



The five-parameter model is directly c-^-ined from the preceding 
models by splitting the output-error probat.JLity 7 into two components 



The generality of the f oui^-parameter model .^tid its parameters e 
and T] are retained^ but we add: 



0^ 



\1-' 



70"= "the probability that an output er 
only one digit is printed in the column to 
preceding column is the only source of outp../. 
appear in the column; 



occurs given that either 
^dded;, or a carry fron the 
=nd no printed digits 



7-j^ =^ the probability that an output errr 
than one digit is printed in the column to 
digit and a carry from the preceding columiL. :: 

This modif ica.tion allows the model to refle.*:- 
dents find it easier to process an addition : 
need do to get the correct answer digit is 
pearing in that column; this^ as opposed tc 
perhaps a carry- before obtaining an ansv/er^ 



ccurs given that more 
.,jLded, or that one printed 

ur , 

ne conjecture that stu- 
-■umn in which all they 
:opy the only digit ap- 
iing two digits and 



The probability that an answer to a gi^ 
may be written as: 



problem will be correct 



(7) P(A11 Answer Digits Correct) 



where D, 



and E are as defined earlier,, and S is the number of 



columns in the problem which inVolve the suzl of at. least two digits^ or 
one digit and a carry 0 Agaln^, for details cf estimation problems in 
this model^ the reader is referred to Suppes ^jid Mornxngstar (1972^ 
PP«lU7-lU8). 
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Register machines with perceptual instructions . To introduce 
greater generality and to deepen the analysis to include specific ideas 
about the perceptual processing of a column- addition exercise^ vc. moved 
on to register machines for the reasons already described in Section I. 

For column addition three registers suffice Ir -i- . ; analyi'ls- 

First there is the stimulus -supported register [SSJ thai holds an encoded 
representation of a printed symbol to which the student is perceptually 
attending. In the p-t-^sent case the alphabet of such symbols consists of 
the 10 digits and tte underline symbol A3 a new symbol is attended 

to, previously stor^u G;^Tnbols are lost unless transferred to a non- 
stimulus- supported register • The second register is the non-stimulus- 
Eupported register Tibs]. It providers long-term storage for computational 
results « I3ie thiri- register is the operation:: register [OP] that acts as 
3 short-t.3rm store:..,. .bx>th for encodings of external stimuli and for results 
of calcula~ions cHsrried out on thevCor±ents of other registers « It is 
alL^^o primar'ily noj^-stiimulus-suppor'bed^. 

As already sta.ted in the main text^ we drastically simplify the 
perceptiial situation by conceiving eadi exercise as being presented on 
a^id vith at most one symbol in each square of the grid. For column 
acn±Ltion number the coordinates of the grid from the upper right- 
hsnd corner. Thus.;, in the exercise 

■j 

15 

2k 

+ 57 



the coordinates of the digit 5 are (l^)^ the coordinates of h are (2^1)^ 
the coordinates of 7 are the cocrdinates of 1 are (l,2) and so 

forth^ with the ^irst coordinate being the rov number and the second 
being the c olumu "number « 

The restriclred set of instructions we need for column addition 
are the f ollowir^. .lOo 

Attend (a^b): 
( ±a^ ±b): 
Readin [SS]: 

Lookup [Rl] + [R2]:; 



Copy [Rl] in [R2.]: 
Deleteright [RJ : 



Direct attep±±on to grid position (a^b). 

Shift attention on the grid by ( ±a^ ±b)- 

Read into the stimulus- supported register the 
physical symtol in the grid position addressed 
by Attend. 

Look up table of basic addition facts for adding 
contents of register [Rl] and [R2] and store 
the result in [Rl]- 

Copy the content of register [Rl] in register 
[R2]. 

Delete the rightaos.t symbol of vegL^x^ r [R] c 
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Jump L: 
J*ump (val) R^L: 



Jump to line labeled Lo 



Outright [R]: 



Jump to line label ' T t ' ^ntent of . -.''iIjIl-v 
[R] is - 

Write (output) the rightmost f?:yTnbol of register 
[R] at grid position addressed by Attend. 



End: 



Teminate processing of current exercise. 



Exit: 



Terminate subroutine processing and retui^n to 
.next line of main program « 



Of the 10 instruct±ons only Lookup does not have an elementary character. 
In OUT complete analysis :it has the status of a subroutine built up from 
more , primitive operations such as those of countings It iS;. of course^ 
i-nore than a proble^i of constructing the table of basic addition facts 
from counting subroutineH; it is also a matter of being able to add a 
single digit to anj atiraber stored in the nonrstimulusrsupported register 
[M3S] or [OP]^ as, for example^ in adding many rows of digits In a given 
column. I omit the details of building up this subroutine. 

It should also be obvious that the remaining nine instructions are 
not a minimal set; fnr example, the unconditional jmp instruction is 
easily eliminated , He do think the nine are both elementary and psy- 
cnologically intuiti'^e for the subject matter at hand* 

To illustrate in a simple way the use of subroutines-, we may con- 
sider two that are useful in writing the program for column addition. 
The first is the vertical scan subroutine, which is needed for the fol- 
lowing purpose o In adding rows of numbers with an uneven nmber cf 
digits^ we ca^inot simply stop when we reach a blank grid square on the 
left of the topmost row. We must also scan doraward to see if there 
are digits in that colunn in any other roWo A second aspect of this 
same problem is that in our model the student is perceptually proces- 
s±ng oanly one grid squiare at a time,, so that he must have a check for 
zinding the bottom row by looking continually for an underline symbol- 
Ctherwise he could, according to an apparently natural subroutine,, pro- 
ceed ±ndefinitely far doraward encountering only blanks and leaving 
entirely the immediate perceptual region of the formatted exercise « 
Here ±s the subroutine « In the main program it is preceded by an Attend 
instmEictiono r 



Vertical Scan Subroutine 



^-scan (0-9^_) 
Rd 



J^mp (0-9._) Fin 
Attend i^l, -1) 



F-=^adin 
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Jtrnip (_) SS. Fin 
Attend (+0^+1) 

• J'>jn:.v lid 
Fin Exit 

The labels Rd and Fin of two of the "^ires are shown on the left. 

The second subrou:tine is one that outputs all the digits in a registt 
v;orking from right to left. For example^ in column addition^ after the 
leftmost colmn has been added^ there may still be several digits re- 
maining to print out to the left of this column in the 'answer' row. 

Output [R] 

Put Outright [r] 

Deleteright [R] 

Attend (O^+l) 

Jump (Blank) R^ Fin 

Jmp Put 
Fin Exit 



Using tiiese two subroutines the program for vertical addition is rela- 
tively straightforward and requires 26 lines... I number the lines for 
later reference; they are not a part of the program,. 



Vertical Addition 

!• Attend (l^].) 

2, Readin 

5* Copy [SS] in [OP] 

k. AirtenoL (+l,+0) 

5 «» ReaSin 

6. Opr Lookog [op] + [3S] 

To Rd Att:e!!iia (-KL,0) 

8. Readi^n 



9. cJtimp (0-9) SS^ Opr 



O 17 

EKLC 



10. 




Jump (Blank) SS^ Rd 


11, 




Attend (+1.,0) 


12, 




Outright [OP] 


15. 




Deleter Ight [OP] 


Ih. 




Copy [OP] in [NSS] 


15. 




Attend (l;,+l) 


16 „ 




V-sean {0-9, _) 


17„ 




Jump (_) SS, Fin 


.180 




Jump (0-9) SS, Car 


19" 




Copy [SS] in [OP] 


20, 




Jump Rd 


21 0 


Car 


Copy [NSS] in [OP] 


22, 




Jump Opr 


25 » 


Fin 


Jump (Blank) NSS^ Out 


2ii, 




Attend (+1/0 ) 


25. 




Output [NSS] 


26. 


Out 


End 



To shov how the program works, we may consider a simple one-column 
additdon exercise « We shew at the right of . each line tXEe content of 
each register just before the next row is attended to, l^^e,. after 
all operations ha.ve been performed. 





[SS] 


[OP] 




k 


if- 


5 


5 


9 


5 


3 


12 


8 


8 


20 






20 


0 


0 





This kind of analysis can be generalized to prove that the program is 
correct^ ioe*^ will output the correct answer to any colimn-acldltion 
exercise^ but this aspect of matters will not be pursued fia^riier here« 

By attaching error parameters to various segments zif the program^, 
performance models are easily generated. For comparattre purposes ve 
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may define a performance model essentially identical to the two- state 
probabilistic automaton already introduced for column addition restricted 
to two rows. To lines 6-12 we attach the output error parameter 7, 
and to lines 15-19 we attach the 'carry' error parameter x] if there 
is a carry, and the error parameter' e if there is not. Given this 
characterization of the error parameters the two performance models are 
behaviorally identical. On the other hand, it is clear that the program 
for the three-register machine is much more general than the two- state 
probabilistic automaton, since it is able to solve any vertical addi- 
tion exerciser It is also obvious that other performance models can 
easily be defined for vertical addition by introducing error parameters 
attached to different segments of the prograin. 

S tudents # To test the models outlined above, as already Indicated, 
we used students participating in the elementary-mathematics CAI programs 
of the Institute. This project supported 10 terminals located in 
Brentwood School in East Palo Alto. More than 80 percent of the stu- 
dents attending Brentwood School are minority students. Much of the 
data reported in publications listed below are dra™ from Brentwood 
School. In addition, the models described above were also applied 
toward the end of the grant period to data drawn from handicapped 
children participating in the GA.I network organized by the Institute 
to include a nmber of schools for deaf students . 



III. RESULTS 

We simnarize in this section the main, results. We emphasize, 
however, that the reader Vho is interested in the detailed testing 
of the models is urged to go to the publications summarized at the end 
of this section in which results are reported. For simplicity, we do 
not consider the testing of the register machine models but concentrate 
here on the linear regression models and the probabilistic automaton 
models . 

Linear regression models . Using the linear regression model 
described in Section 11^ the following regression was obtained for 
the mean response data of 65 third graders taking the pretest shown in 
Table 1. Complete experimental details about the students, etc.^ are 
given in Suppes and Morningstar (1972, Ch. 5)« 

p. = .55 SUMR. + .95 CAR. + .51 VF. - ho06 .. ■ 

ISie multiple R was .7^ and R^ was -.5^^ which reflects a reasonable fit 
t:.3 the data. Extensive data analysis of a similar kind for other basic 
skills, using similar regression models, is reported in Suppes and 
l^forningstar (1972). 

Test of automaton models for addition. Tests of the three-, four- 
and five-parameter addition models for third-grade addition. Including 



19 



TABLE 1 

Pretest Exercises in Column Addition 



1) 



2) 



17 
+ 2 



6 

D 



8) 11 

22 
+ Ik 



9) 27 
+ 



15) 5267 
+ 283 



16) 



75 
+ 23 



3) 



lU 



10) 



8 

+ 52 



17) 3986 
+ '4735 



7) 



6 

^12 



5) 363 
+ 21U 



6) U16 
+ 212 



12 

51 
+ 10 



11) 63? 
+ 212 



12) 66 
+ Ik 



13) 378 
+ 125 



Ik) 557 
+ 256 



18) 



20) 



27 

kG 
+ 88 



19) 7657 
+ 1875 



69 
56 
+ U8 
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the exercises displayed in Table 1, as well as similar fourth • grade- 
exercises, are shown in Table 2* Both sets of exercises were given 
both as pretests and posttests, and the corresponding parejneter esti- 
mates are shown in the table. 

The fit of the predictions to observed pi'oportion correct is showm 
in Figui'e 1. 

These data^ reported in a JO-^ag^e chapter of Suppes and Morningstar 
(1972);, were analyzed in conjunction with Mr. Alex Cannara, Interested 
readers are referred to Chapter k of that book. 

Publications ^ Research performed under this grant has been pub- 
lished in the following articles » 

Pa S-uppes and Mo Morningstar, Four programs in computer-assisted 

instruction,* In Wo Ho Holtzman (Ed.)^ Computer - assisted instr u ction ^ 
t(U ting ^ and guidance . New York: Harper "Sc Row, 1970* Pp. 253-265 « 

Po Suppes and Mo Morn Technological innovations: Computer- 

assisted instruction and compensatory education <, In Fo Korten, 
Se Cook & J. lacey (Lids, )^ Psychology and the problems of society^ 
Washington, C*: American Psychological Association, Inc.^ 1970o 
Pp. 221-2560 

Suppes « Computer-^assisted instruction at Stanfordo In Man and 
com puter o Proceedings of international conference, Bordeaux 1970* 
Bar.el: Karger^ 1972. Ppo 298- 55O. 

Po Suppes and Mo Morningstar c Computer - assisted instruction at Stanford, 
1966-68: Data 5 m odels ^ and evaluation of the arithmetic programs » 
New York: Academic Press^ 1972., 555 PP^ 

Po SuppeSo Facts and fantasies of educationo In Mo Co Wittrock (Ed*).? 
Changing schools : Alternatives from educationa l research o Englewood 
Cliffs, No Jo: Prentice-Hall^ 1975o "(The technical appendix of this 
article reports research conducted under this grant,) 



Publications in preparation o The principal investigator^ Patrick 
Suppes^ together with his collaborators in the Institute^ are in the 
process of preparing a book that will be a sequel to the Suppes and 
Morningstar (1972) book^ and that will report a number of detailed anal- 
yses of the models described in this reports There will be an especially 
long chapter on the register machine models written by Suppes and Lindsay 
Flannery^ a graduate student in the Institute o Part of the results on 
register machine models of Suppes and Flannery will also be written for 
publication in article form, probably in the Journal of ]\ fe.thematical 
Psychology o 
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0 4 8 12 16 20 

PROBLEM 



Fig, 1, Proportion correct per problem for data and models. 
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IV. COIJCLUSIOIIS 



Several conclusions emerge from this research over a period of 
several years on mathematical models of elementary- mathematics leai^ning 
and performance. We hreal-: these conclusions into four broad classes. 

1-, Emp iri cal Adequacy 

The V70rk indicated in this report^ but reported more extensively 
in Suppes and Morningstar (1972)^, and in other publications that have 
not yet appeared, demonstrates that the kind of process models using 
ideas from the theory of automata and the abstract theory of register 
machines have direct application to the empirical study of students' 
performance in elementary arithmetic skills o It is fair to say that 
the models developed as the main focus of this project are probably 
the most detailed process models that yet exist in the psychological 
or educational literature, insofar as the basic skills of elementary 
mathematics are concernedo From the work performed it is fair to con- 
clude that automaton models of the basic operations of addition^ sub- 
traction^ multiplication and division can provide a detailed account 
of student performance^ including especially the analysis of the 
typical errors that students make^ 

Qp^n Questions 

At the same time^ the research begun under this project leaves more 
questions open than solvedo The automaton models, which were most 
thoroughly tested empirically^ do not provide anything like a satis- 
factory account of the perceptual processing engaged in by students^ 
nor do they take adequate account of the actual real-time operations. 
The models are defined in terms of discrete time rather than continuous 
t±me^ and no perceptual apparatus is assumedc Extensions in both of 
these directions are needed in order to develop more adequate models* 
The first steps of incorporating some simple perceptual processing were 
developed as part of the register machine models described in Section II 
but these models represent only a very elementary and simple form of per- 
ceptual processingo The actual perceptual processing engaged in by stu- 
dents is obviously more elaborate and more complex in nature « 

5' Nee d for Learning Model s 

Although it was the original intention of the project to develop a 
vide range of learning models as well as performance models^ the details 
of the performance models developed and tested in this project were suf- 
ficiently complex rnd -^^ -obler -collecting adequate bodies of data 
were sufficiently ..iff tha'^. iii'. next steps to correspondingly de- 
tailed learnin/ :::odc a:^ aot takeno Some work on learning was described 
in Suppes (l975j^ and further work on learning will be reported in some 
of the unpublished work yet to appear^ but it is fair ^to sa^^, that the 
models as described above concentrate almost entirely on performance « 
The beginning of a complex theory of learning for finite automata in 
Suppes (1969) and in Rottmayer (l970) form a promising beginning but 
further work is needed for extensione 
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Applications 



ite of the shortcomings of the models developed as described 

ac automaton and register machine models developed already have 

pc" pplications in the organization of individualized cia-riciilum 

ir. -y mathematics, especially in a computer-assisted inr ^ rv... tier: 

se' 'L: . -rhaps the central criticism tliat can be made of most ..'ork in 

con --r^ ■ isted instruction at the present time is the relative sim- 

pl- uhe model of the student asstuned. The kind of models developed 

her-: - :)ugh still far too simple^ do take a definite step toward pos- 

tul :3re than a simple error rate and a simple latency of response 

in ..,-:ent, and do attempt to impose an internal structure on his 

pre of curriculm materials. As more deeply individualized ciu-- 

ric . attempted, the kind of models developed as' basic research 

unde • project should play a natural part in providing tools fri- the 

deveJ t of such highly individualized curriculum. 

otential of the approach is that presenting exercises to tlie 
studr . an individualized basis will no longer be based sirr'.ply on 

the rates and latency rates of the st'^adent, but on the estimate 

of c^-' - ■ ..fd parameters of a model of his irternal processing capacity « 
Err::i':'i- _.:l now be translated into error parameters that have a meaning- 
ful ir.: retation in a process model of the basic skills the student 
is 1^. Applications of such detailed models would seem difficult 

in cUs - -\^om settings, but it is considerably more practical when placed 
Ir 3. G^V ?tting^ and there is reason to hope that such models \n.ll be 
dv!:"yvji.ai~ luring the coming decade. 
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